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A HIGH RESOLUTION TYPING SYSTEM FOR PATHOGENIC 
MYCOBACTERIUM TUBERCULOSUM 

5 CLAIM TO DOMESTIC PRIORITY 

[0001] This application claims the benefit of priority to US Provisional 

application Serial No.60/3 97,224 entitled "A High Resolution Typing System For 
Pathogenic Mycobacterium tuberculosum" filed July 19, 2002, by Paul S. Keim et al., 
and is herein incorporated by reference in its entirety. 

1 0 FIELD OF THE INVENTION 

[0002] The present invention is generally directed to sub-typing Mycobacterium 

tuberculosum, the causative agent of tuberculosis, and is more specifically directed to 
PCR amplification of variable number tandem repeat sequences (VNTR) with primer 
pairs designed to amplify specifically certain VNTR identified in Mycobacterium 

15 tuberculosum isolates. Results of the analysis may be compared to results from known 
Mycobacterium species to determine the sub-type of the species for epidemiological and 
diagnostic purposes. 

BACKGROUND OF THE INVENTION 

20 [0003] Tuberculosis (TB) is the primary cause of death due to a single microbial 

pathogen, accounting for 2 to 3 million deaths and an estimated 8 to 10 million new cases 
per year worldwide (1). It is estimated that one third of the world's population is infected 
with Mycobacterium tuberculosum. The global epidemic is growing, concomitant with 
HIV co-infection and the increasing prevalence of drug resistant strains. Genotyping 

25 clinical isolates of M. tuberculosum has the potential to significantly impact both clinical 
practice and public health. Clinically, determining strain relationships can distinguish 
treatment failure versus potential laboratory cross-contamination, and reactivation of 
drug-resistant disease versus infection with another strain (24). In the public health arena, 
determining strain relationships during the course of a tuberculosis outbreak can focus 

30 public health resources. However, the most discriminatory M. tuberculosum genotyping 
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method, IS61 10 restriction fragment length polymorphism (RFLP) analysis, is time- 
consuming, technically demanding, and difficult to standardize between laboratories (3). 
In addition, IS61 10 RFLP does not provide sufficient strain discrimination when fewer 
than six IS61 10-hybridizing bands are present. A secondary genotyping method is 
5 required in these instances. A number of PCR-based genotyping methods have been 
developed, but are limited in their ability to reproducibly provide a level of strain 
discrimination that matches associations defined by traditional epidemiology. In 
addition, the high degree of coding region sequence conservation between strains of M. 
tuberculosum virtually excludes the use of more commonly used population genetics 
1 0 methods like multi-locus enzyme electrophoresis, restriction fragment length end labeling 
analysis, and gene sequencing (12). 

[0004] Spoligotyping is currently the most widely used PCR-based method of M 

tuberculosum genotyping due to the relative technical ease of the protocol and the ability 
to compare results between laboratories using a standardized nomenclature system (6). 

1 5 However, the discriminatory power of this method is low. In a database of 3,3 1 9 

spoligotypes from isolates collected in geographically distinct areas, seven spoligotype 
patterns were found for 37% of all clustered isolates (18). These spoligotype patterns are 
associated with strains having a much larger variety of IS61 10 RFLP patterns. For 
instance, 18% of isolates in the spoligotype database had the "Beijing" spoligotype 

20 pattern, characterized by the presence of spacer oligonucleotides 39 through 43 (18). 
However, although the W-Beijing family of strains is recognized as one of the most 
homogeneous families of M tuberculosum strains (19), the Beijing spoligotype is 
associated with at least 450 distinct strains as determined by IS61 10 RFLP analysis (2). 

[0005] Following the public release of the complete genomic sequences of M 

25 tuberculosum strains H37Rv and CDC- 1551, investigators began the process of 

identifying repetitive genetic elements to develop higher resolution DNA typing systems. 
Variable number tandem repeats (VNTR) of 21-1 1 1 bp genetic elements have been 
identified (9, 14, 17, 20, 21). A set of 12 VNTRs, also known as mycobacterial 
interspersed repeat units (MIRUs), was more discriminatory than IS61 10 RFLP when 
30 used to analyze 180 M, tuberculosum isolates with 6 or fewer IS61 10-hybridizing bands. 
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The combination of MIRU analysis and spoligotyping and IS61 10 RFLP analysis has 
provided maximum specificity (5). 

[0006] However, a greater discriminatory power for resolution and differentiation 

5 of species is necessary for sub-typing Mycobacterium Xuberculosum species in order to 
track sources of infection and ultimately to prevent the spread of disease. Methods and 
means for determining the genetic differences between M. tuberculosum species with 
speed, accuracy and with great discriminatory capacity have been sought. 

BRIEF DESCRIPTION OF THE FIGURES 

10 

[0007] FIG. 1 . is a genomic survey location of MLVA and MIRU markers. The 

location of nine MLVA and 12 MIRU markers are shown on the H37Rv genome. The 
number in brackets indicates the location of the 5' forward primer for each marker. All 
accession numbers of forward primer locations are given using the H37Rv sequence. 

15 

[0008] FIG. 2 is a picture of the gel pattern from electrophoresis of MLVA 

fragments illustrating the fluorescent image of markers Mtb-vl,-v3,-v5, and -vlO on an 
ABI377 electrophoresis gel. Each marker allele is identified through a unique size and 
color combination, allowing adequate discrepant ability across comparable-sized 
20 fragments from sundry alleles. 

[0009] FIG. 3. is a dendrogram illustrating grouping resulting from UPGMA 

analysis of spoligotyping (A) and MLVA (B, C) data. Clustering using spoligotyping 
data across 91 isolates is shown (A). The group SP-A is broken into five genotypes using 
25 MLVA (B). For section C, MLVA resolves the Beijing cluster into seven genotypes, 
designated MLVA-BJ-A through BJ-G. 
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[0010] The present invention discloses methods and means for detecting and sub- 

typing M tuberculosum species by multi-locus VNTR analysis (MLVA) genotyping 
5 system of VNTR identified within the genome of M. tuberculosum In the system, 

independent, tandemly-inserted repeated motifs in the M. tuberculosum genome are amplified 
using fluorescently-labeled primers in multiplexed PCRs and electrophoretically separated on 
polyacrylamide gels for analysis. 

[0011] VNTR loci located throughout the H37Rv genome have been identified. Isolated 

10 nucleic acids having the sequence of VNTRs identified in M tuberculosum are presented. 
In an important aspect of the present invention nucleic acids comprising at least 12, 15, 
1 8 or total consecutive nucleotides of a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3; SEQ ID NO: 4; SEQ ID 
NO: 5; SEQ ID NO: 6; SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9 SEQ ID NO: 10 
15 SEQ ID NO: 11 SEQ ID NO: 12 SEQ ID NO: 13 SEQ ID NO: 14 SEQ ID NO: 15 SEQ 
ID NO: 16 SEQ ID NO: 17 and SEQ ID NO: 18 and sequences complementary thereto 
are presented. 

[0012] In certain preferred embodiments of the invention, these nucleic acids are 

immobilized on a solid surface and are useful, for example, in the detection of a M 
20 tuberculosum species in an assay employing probes, including, but not limited to, a nano- 
detection device. 

[0013] In another important aspect of the invention, primer pairs comprising a 

forward and a reverse primer are presented for amplification of VNTR located in DNA 
from a M tuberculosum species. Primer pairs suitable for PCR amplification of VNTR, 

25 by MLVA or by multiplex, may be selected from the group consisting of SEQ ID NO: 1 
and 2, SEQ ID NO: 3 and 4, SEQ ID NO: 5 and 6, SEQ ID NO: 7 and 8 SEQ ID NO: 9 
and 10, SEQ ID NO: 1 1 and 12, SEQ ID NO: 13 and 14, SEQ ID NO: 15 and 16, and 
SEQ ID NO: 17 and 18. Certain preferred primer pairs have, in addition, an observable 
group whereby amplified product may be detected. Such groups may be, for example, a 

30 fluorescent group or a radioactive group. 
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[0014] In yet another important aspect of the invention, a method for detecting a 

M. tuberculosum species is presented. The method comprises the steps of: 

i. obtaining a DNA sample from said species, 

ii. amplifying a VNTR marker loci in said DNA with one or more 
primer pairs; and 

iii. detecting an amplification product that contains the VNTR 
sequence. 

[0015] In another important aspect of the invention, MLV A methods are 

presented for observing polymorphisms at VNTR loci in DNA from more than one M 
tuberculosum species to resolve unique genotypes between the species and to allow sub- 
typing of the species into distinct groups. These MLVA methods provide a convenient 
and rapid method for strain discrimination in M. tuberculosum. MLVA may be applied 
for strain discrimination among globally diverse M. tuberculosum isolates. 

[0016] The method of sub-typing a M. tuberculosum strain comprises the steps of: 

i. obtaining DNA from said strain; 

ii. amplifying said DNA with one or more primer pairs selected from the 
group consisting of SEQ ID NOS: 1-8 and 9; 

iii. detecting said amplified product; 

iv. determining the diversity number of said amplified product; and 

v. comparing said diversity number with the diversity number for a 
known strain of M tuberculosum. 

The method may further comprise MIRU analysis for further discrimination of species. 

[0017] In yet another important aspect of the invention, kits are provided for 

detecting and sub-typing M tuberculosum species. The kits comprise one or more primer 
pairs suitable for amplifying VNTR in DNA in a sample of said species and may 
comprise, in addition, nucleic acids, enzymes, tag polymerase, for example, and buffers 
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suitable for causing amplification by PCR, by MLVA or by multiplex, for example. In 
certain preferred embodiments of the kit the primers comprise a label whereby amplified 
VNTR may be detected. In other preferred embodiments of the kit, labeled nucleic acids 
are provided. Observable labels are preferably fluorescent molecules or radionucleotides. 
5 Certain kits comprise in addition primers and reagents for MIRU analysis of a DNA 
isolate. 

DETAILS OF THE INVENTION 

[0018] MLVA methods are presented for rapid genotyping of M tuberculosum 

10 MLVA targets described herein provide discriminatory power that enhances the ability of 
present methods to determine rapidly molecular relationships of M tuberculosum 
isolates. A system comprising a combination of VNTR and MIRU analyses is provided 
for rapid genotyping of M tuberculosum. 

1 5 [0019] This MLVA typing system is a PCR-based method for genotyping M. 

tuberculosum using VNTR loci identified in the present invention. This PCR-based 
typing has advantages not present in other PCR-sy stems: rapid turnaround, amplification 
with crudely isolated or minute amounts, of DNA, and the ability to use nonviable 
samples which can be transported across various labs with greater ease than viable ones. 

20 The rapid typing system using nine VNTR loci has been used to group a collection of 57 
isolates of limited diversity (all had 4 IS61 10 bands), and 34 isolates from the Beijing 
strain family. Smaller repeat motifs, for example <50 bp, as credible additions to the 
molecular typing of M tuberculosum may also be used. In the MLVA system fluorescent 
labeling of primer and size of each allele allow for unique and easy identification of each 

25 VNTR. Automated gel analysis allows for a high throughput system that reduces error 

and time in obtaining results. High resolution electrophoresis and discrete repeats allows 
for the use of standardized data sets and facilitates central databases. An example of the 
database obtained is given in Table 4. 

[0020] The ultimate utility of VNTR loci lies in their diversity. The present 

30 invention discloses the use of marker diversity using both allele number and frequency to 
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sub-type Mycobacterium tuberculosi species. VNTR markers that exhibit high diversity 
values possess great discriminatory capacity for identifying genetically similar strains. 
Less diverse markers may be applied with greater utility in species identification and the 
analysis of evolutionary relationships. This demonstrated ability to predict VNTR 
5 diversity based upon array size allows the guided selection of marker loci. 

[0021] MLVA analysis using the VNTRs presented here can also contribute to the 

MIRUs recently developed by Mazars et al. (Mazars, E., S. Lesjean, A. L. Banuls, M. 
Gilbert, V. Vincent, B. Gicquel, M. Tibayrenc, C. Locht, and P. Supply 2001. High- 
resolution minisatellite-based typing is a portable approach to global analysis of 

10 Mycobacterium tuberculosum molecular epidemiology as described in Proceedings of the 
National Academy of Sciences of the United States of America. 98:1901-6)(14), herein 
incorporated in its entirety. The primers of the present invention may be labeled with the 
same fluorescent dyes as used in the MIRU set, and may be analyzed on the same 
electrophoretic run as the MIRUs because of their smaller size and allele composition. 

15 MIRU and Mtb markers may be incorporated into the same PCR multiplex reactions. 

The combination of markers and primers extends the power of the VNTR typing system, 
without additional time or labor, and increases the discriminatory ability of the present 
VNTR-typing. 

[0022] Molecular typing of M. tuberculosum using VNTR may produce a useful 

20 molecular clock. The "clocking" of a VNTR is a credible methodology for time- 
geographical correlation In MLVA, different rates of evolution for each VNTR loci may 
be observed. VNTR markers that match the type of epidemic we are tracking may be 
chosen. For example, cases of M. tuberculosum incidence in endemic areas would 
intuitively have little variation amongst markers of low diversity. The antithesis of that 
25 would be that more diverse markers would be poor for establishing phylogeny across a 
diverse isolate collection, which is akin to similar findings with IS61 10 elements (7). 
However, with MLVA it is possible to pick and choose what markers would be most 
appropriate for a given epidemic. For example, it is possible to deduce that VNTR loci 
with a low diversity index across a worldwide set would have an even lower or non- 
30 existent value across a local outbreak. This ability to create a wide gamut of markers of 
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sundry diversity is helpful in a geographically limited area where a particular genetic 
clone may be more prevalent. The intrinsic value of this concept supports the idea that 
some VNTR are not suitable for possible transmission routes, but better as informers of 
plausible evolutionary scenarios, and vice-versa (11, 22). 

5 [0023] Some VNTR markers demonstrate an affinity between actual genetic 

distance and the expansion or contraction of a repeat motif. The dendogram demonstrates 
this cascading trend with Mtb-vl. Within the Beijing collection, any cluster with a 0.1 1 
difference (one allele) from each other had a repeat motif that differed by only one repeat 
or allele. For example, MLVA-BJ-A and BJ-B differ by one marker score in Mtb-vl 0. 

10 Yet MLVA-BJ-C and BJ-D, which differ from MLYA-BJ-A and BJ-B by more than one 
marker allele, have an Mtb-vl allele size of 1 10, as opposed to 1 19 in MLVA-BJ-A and 
BJ-B. This indicates that phylogeny can be determined further by the difference in repeat 
motifs of any particular marker. Each marker gives two types of data: the raw score of 
allele size, and a relative allele relationship. This relative relationship is possible with 

15 MLVA, where the diversity within one marker can be greater than 0.5, whereas binary 
data can give a maximum diversity of 0.5 for any one locus. 

[0024] The dominance of a particular marker allele could prove a useful tool in 

quickly referencing a clonal dispersion is a helpful tool in phylogeny studies. The 
analysis of a distinctive allele type is only viable when multiple loci are combined in 
20 analyzing the data. By doing so, it is possible to curtail the effect of convergent evolution 
in the analysis. 

[0025] The most immediate public health application of this typing method will 

be its use during an outbreak. Determination of strain relationship between infected 
individuals could help in deducing routes of transmission. This information could save 
25 time and money for clinics and health-related professionals involved in eradication of M. 
tuber culosum. It is anticipated that this methodology will be a constructive addition to the 
molecular epidemiological methods currently used to track M. tuberculosum. 

The following definitions are used herein: 
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[0026] "Polymerase chain reaction" or "PCR" is a technique in which cycles of 

denaturation, annealing with primer, and extension with DNA polymerase are used to 
amplify the number of copies of a target DNA sequence by approximately 106 times or 
more. The polymerase chain reaction process for amplifying nucleic acid is disclosed in 
5 US Pat. Nos. 4,683,195 and 4,683,202, which are incorporated herein by reference. 

[0027] "Primer" is a single-stranded oligonucleotide or DNA fragment which 

hybridizes with a DNA strand of a locus in such a manner that the 3' terminus of the 
primer may act as a site of polymerization using a DNA polymerase enzyme. 

[0028] "Primer pair" is two primers including, primer 1 that hybridizes to a single 

1 0 strand at one end of the DNA sequence to be amplified and primer 2 that hybridizes with 
the other end on the complementary strand of the DNA sequence to be amplified. 

[0029] "Primer site" is the area of the target DNA to which a primer hybridizes. 

[0030] "Multiplexing" is a capability to perform simultaneous, multiple 

determinations in a single assay process and a process to implement such a capability in a 
1 5 process is a "multiplexed assay." Systems containing several loci are called multiplex 

systems described, for example, in US Patent No. 6,479,235 to Schumm, et al., US Patent 
No. 6,270,973 to Lewis, et al. and 6,449,562 to Chandler, et al. 

[0031] "Isolated nucleic acid" is a nucleic acid which may or may not be identical 

to that of a naturally occurring nucleic acid. When "isolated nucleic acid" is used to 

20 describe a primer, the nucleic acid is not identical to the structure of a naturally occurring 
nucleic acid spanning at least the length of a gene. The primers herein have been 
designed to bind to sequences flanking VNTR loci in Mycobacterium tuberculosum 
species. It is to be understood that primer sequences containing insertions or deletions in 
these disclosed sequences that do not impair the binding of the primers to these flanking 

25 sequences are also intended to be incorporated into the present invention. 

[0032] VNTRs useful in the methods of the present invention for detection and 

sub-typing Mycobacterium tuberculosum have been identified. Eighty-four regions 
containing tandem repeat sequences as potential VNTRs from the H37Rv sequence were 
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identified. The genomic location of nine identified VNTRs is illustrated in Figure 1. At 
least one VNTR marker was discovered in every Mbp region, and five were clustered in 
the first Mbp. Short sequence repeats (SSRs) that can be readily PCR amplified and 
molecularly sized with automated fluorescence-based instrumentation were identified. 

[0033] Table 2 gives the characteristics of these Mtb VNTRs,namely the VNTR 

loci identification number, amplicon size, size range* motif times repeat and VNTR motif 
within gene. Table 3 gives the allelic number and diversity of the VNTR across the data 
set. 



Table 2. Characteristics of Mtb VNTRs 4 . 



VNTR loci 


Amplicon size Size range 1 


Motif x repeat 2 VNTR motif within gene 


Mtb-vl 


113 


107-119 


3x7+1 


PonA - penicillin-resistance gene 


Mtb-v3 


152 


140-152 


11x3 


Rv0203 - hypothetical gene 


Mtb-v4 


146 


137-164 


9x4+2 


None - 5' adjacent to Rv0393 


Mtb-v5 


212 


196-230 


18x3-1 


hbhA - Heparin-binding hemagglutin gene 


Mtb-v6 


284 


278-290 


6x4+2 


Rv0996 - hypothetical gene 


Mtb-vlO 


164 


155-181 


9x3+6 


IppP- probable lipo protein-cell wall metabolism 


Mtb-vl5 


415 


278-591 


55x2-18, 79x3-133 


None - 5' adjacent to Ribonuclease III gene 


Mtb-vl 8 


136 


133-136 


3x4+2 


Rv3780 - unknown gene 


Mtb-v20 


560 


560-578 


18x2+4 


Rv0019c - conserved hypothetical gene 


ISize range given corresponds to range seen across all collections 




2Numbers with a + or 


- sign in front represent the number of addition nucleotides present/absent in a complete repeat 


3Non-tandem polymorphic repeats include: (8x6, and 12x2) 




4AII table data is collected and representative of findings from the H37Rv strain. 


Table 3. Allelic number and diversity of VNTR across data set. 












Diversity 


VNTR name 


Number of Alleles 


AU 


Beiiing Four-band IS6110 


Mtb-vl 


5 




0.59 


0.25 0.23 


Mtb-v3 


2 




0.06 


0.0 0.09 


Mtb-v4 


2 




0.08 


0.16 0.03 


Mtb-v5 


2 




0.02 


0.0 0.03 


Mtb-v6 


2 




0.02 


0.0 0.03 


Mtb-vlO 


4 




0.48 


0.16 0.30 


Mtb-vl5 


4 




0.14 


0.20 0.06 


Mtb-vl8 


2 




0.12 


0.20 0.03 


Mtb-v20 


2 




0.02 


0.06 0.0 
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The results of PCR assay of these repeats after electrophoretie separation is given 
in Figure 2. 

[0034] In this relatively homogenous set, diversity across the nine loci varied 

5 greatly, as measured by allele number range and diversity value range (Table 3). Six of 
the loci (Mtb-vl, v3, v5, v6, vlO, vl5) were variable across the collection of isolates 
having 4 IS61 10 bands, and six loci (Mtb-vl, v4, vlO, vl5, vl8, v20) were variable 
across the Beijing collection. In contrast, three VNTR loci (Mtb-vl, vlO, vl5) had four 
alleles and diversity values ranging from 0.14 to 0.59 across all samples. 

1 0 [0035] As observed in Table 3, a larger number of alleles did not always 

correspond to a greater diversity value. In one sense, allele number represents potential 
discrimination power while the diversity value represents the realized discrimination in a 
given set of samples. Both values are useful for understanding the genetic loci and the 
isolate sets being observed. 

15 [0036] When the 34 Beijing family strains were considered separately, the 

number of alleles and diversity values dropped. Six marker loci were informative within 
Beijing strains and three (Mtb-vl, vl5, vl8) were still relatively diverse, with diversity 
values of 0.25, 0.20, and 0.20, respectively. Markers Mtb-v4, vl5, and were more diverse 
across the Beijing collection than across the four-bander set (Table. 3). Diversity values 

20 for collection of isolates having 4 IS61 10 bands ranged from 0.0 to 0.3. The diversity 
values for Mtb-v5, v6, and v20 correspond to only one allele variation, in one isolate, 
across the entire collection set as illustrated in Table 4. 

[0037] The diversity values of Mtb-vl , and Mtb-vlO across the combined strain 

set were greater than for either strain set alone. This was due to differential allele 

25 frequency, as opposed to unique allele number, across each data set. The Mtb-vl size of 
1 16 bp had a frequency of 88% within the set of strains having four IS61 10 bands. In 
contrast, the allele size of 1 16 bp was seen on one occasion across the Beijing strain set, a 
2.9% frequency. For Mtb-vlO, a similar pattern of allele biasness can be seen. The allele 
size of 154 bp is seen in 84% of the isolates having 4 IS61 10 bands. In contrast, the allele 

30 size of 1 54 is not present at all in the Beijing strains. Twenty-nine (85%) of the Beijing 
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strains had an allele size of 163, whereas eight (14%) of the isolates having 4 IS 61 10 
bands had an allele size 163. Hence the higher diversity value across Mtb-vl and Mtb- 
vlO when the data is combined. 

[0038] The present invention provides primer pairs for PCR amplification of 

VNTR in DNA of M. tuberculosum. The primer pairs comprise a forward primer and a 
reverse primer. Table 1 illustrates the M tuberculosum Primer Sequences of the present 
invention. Table 1 also gives the source (location) of the VNTR. The location of each 
VNTR locus is given by the 5' of the forward primer in accordance with the H37Rv 
reference strain. 

Table 1. Name, Location, and sequence of primers 



Primer Location (kb)" 



Forward Primer Sequence 



Reverse Primer Sequence 



Mtb-vl 

Mtb-v3 

Mtb-v4 

Mtb-v5 

Mtb-v6 

Mtb-vlO 

Mtb-vl5 b 

Mtb-vl 8 

Mtb-v20 f 



55,481 
241,423 
472,658 
566,196 
1,122,852 
2,604,134 
3,239,432 
4,226,924 
23,420 



GTCGAACGAGACTTTCCCCAAACCGAC 

GATGACGGATCGTCGGGGGCGGGAAC 

GCTGTGGCGCAGCTACACAGTACGACTC 

GGAGGCGTTGGGTACGGTCGCATC 

CGCCGACGAGGCCGATGCCGAAGC 

CGAGG CGCCC AGCCCC ACAA 

GCGCCGCACCACCTCGACTT 

ACAACGGCGAGGCCCGAATCTACGAA 

CCCGGAGGGCCAGAGGGCACATAGC 



GACCGTGGGCTGGATGACGGTCTC 

GCAACGCGAGGGCGATCAGTACTGCCAACA 

GATTGCGCAGCGCCCAACAGC 

GATTCGGAGCCCGACTACTTCTGGGT 

CCGCGGCGGCAGAGCCAACCAGGAT 

CACCCGCGCTTTAGGATCGACACCTGA 

CCGGGCAAAACCTCCGCCTAAC 

GTCGACGCCGCCGATGACC 

TGGCGCAGAACCAGGAGTAGCACCAATGAG 



" Location of VNTR locus is given by the 5' of the forward primer in accordance with the H37Rv reference strain sequenced in 1998. 
b Mtb-vl5 is designed loci ETR-F, named and developed by Frothingham, and its similar location was identified retrospectively here (9). 
c Mtb-v20 was recently identified by Le Fleche(13), and given the name MtubOl. 



These primer sequences have herein been assigned SEQ ID NO: as follows: 
Table 4 SEQ ID NO Marker Name Primer Type 



SEQ ID NO: 1 
SEQ ID NO: 2 
SEQ ID NO: 3 
SEQ ID NO: 4 
SEQ ID NO: 5 
SEQ ID NO: 6 
SEQ ID NO: 7 
SEQ ID NO: 8 



Mtb-vl 



Mtb-vl 



Mtb-v3 



Mtb-v3 



Mtb-v4 



Mtb-v4 



Mtb-v5 



Mtb-v5 



Forward primer 
Reverse primer 
Forward primer 
Reverse primer 
Forward primer 
Reverse primer 
Forward primer 
Reverse primer 
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5 



15 



SEQ ID NO: 9 


Mtb-v6 


Forward primer 


SEQ ID NO: 10 


Mtb-v6 


Reverse primer 


SEQ ID NO: 1 1 


Mtb-vlO 


Forward primer 


SEQ ID NO: 12 


Mtb-vlO 


Reverse primer 


SEQ ID NO: 13 


Mtb-vl5 


Forward primer 


SEQ ID NO: 14 


Mtb-vl5 


Reverse primer 


SEQ ID NO: 15 


Mtb-vl8 


Forward primer 


SEQ ID NO: 16 


Mtb-vl8 


Reverse primer 


SEQ ID NO: 17 


Mtb-v20 


Forward primer 


SEQ ID NO: 18 


Mtb-v20 


Reverse primer 



20 [0039] The polynucleotides of the present invention may be prepared by two 

general methods: (1) they may be synthesized from appropriate nucleotide triphosphates, 
or (2) they may be isolated from biological sources. Both methods utilize protocols well 
known in the art. The availability of nucleotide sequence information enables 
preparation of an isolated nucleic acid molecule of the invention by oligonucleotide 

25 synthesis. Synthetic oligonucleotides may be prepared by the phosphoramidite method 
employed in the Applied Biosystems 38A DNA Synthesizer or similar devices. The 
resultant construct may be purified according to methods known in the art, such as high 
performance liquid chromatography (HPLC). Complementary segments thus produced 
may be annealed such that each segment possesses appropriate cohesive termini for 

30 attachment of an adjacent segment. Adjacent segments may be ligated by annealing 
cohesive termini in the presence of DNA ligase to construct an entire long double- 
stranded molecule. A synthetic DNA molecule so constructed may then be cloned and 
amplified in an appropriate vector 

[0040] Methods for using these primer pairs to amplify VNTR loci in 

35 Mycobacterium tuberculosum are disclosed herein. Generally MLVA analyses or 
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multiplex systems known to the art may be employed to detect and sub-types 
Mycobacterium tuberculosum. PCR instruments, for example, thermo-cyclers, used in 
these amplification methods are commercially available. 

[0041] Methods for applying MLVA of these VNTR for strain discrimination 

5 among M. tuberculosum isolates are disclosed. Polymorphisms at these loci were may 
be used to resolve genotypes into distinct groups. Among all 91 isolates, the nine-marker 
MLVA identified 17 unique genotypes. 12 unique MLVA genotypes are shown in Fig. 3, which 
were identified from 2 spoligotype clusters, Sp-A, and Sp-L. Spoligotype genotype 
cluster A was represented by five MLVA genotypes (Fig. 3b.). While the Beijing family 
10 isolates (34 total), were subdivided into seven unique types (Fig. 3c) based upon six 
informative marker loci (Table 2). Two of the isolates were unique, with the other 
isolates in clusters of 19, 5, 3, 3, and 2 (Fig. 3c). 

[0042] As observed in Table 3, a larger number of alleles did not always 

correspond to a greater diversity value. In one sense, allele number represents potential 
15 discrimination power while the diversity value represents the realized discrimination in a 
given set of samples. Both values are useful for understanding the genetic loci in any 
isolate being observed. 

[0043] The amplicon size range for 7 of the loci is 107-290 bp. This size range 

allows for electrophoresis runs of roughly 2 hours on the ABI377 system (Applied 
20 Biosystems Incorporated, Foster City, CA). Two markers (Mtb-vl5, Mtbv-20) have 
larger ranges from 278-591 bp (Table. 2). The number of alleles across these nine loci 
ranged from 2 to 8 (Table 3). 

[0044] All of the polymorphic markers, except Mtb-v20, contained three or more copies 

of a tandem repeat sequence in the H37Rv sequence. Of these, seven had repeat motifs of three 
25 base multiples (Table 2), consistent with "in-frame" insertion and deletion mutational events. 

Seven loci were within hypothetical or known ORFs, while two were adjacent to genes (Table 2). 

[0045] Figure 3 is a dendogram based upon Unweighted Pair Group Method 

Arithmetric Average (UPGMA) analysis of spoligotyping (A) and MLVA (B, C) data, 
illustrating the resolution of unique genotypes into distinct groups. Clustering using 
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spoligotyping data across all 91 isolates is shown (A). The group SP-A is broken into five 
genotypes using MLVA (B). For section C, MLVA resolves the Beijing cluster into 
seven genotypes, designated MLVA-BJ-A through BJ-G. 

[0046] The present MLVA methods provide high species discrimination and sub- 

typing of Mycobacterium tuber culosum. Spoligotyping differentiated the 91 isolates into nine 
unique genotypes (Fig. 3a). The spoligotyping dendogram revealed the deepest branch at 0.26, 
which would concur with the number of loci considered in UPGMA, which is 43. By definition, 
the Beijing strains share spoligotype pattern designation 000000000003771 (6). 

[0047] In addition IS61 10-based restriction fragment length polymorphism (RFLP) may 

be used in a sub-typing scheme. 

Table 5 illustrates clustering by these three methods. 
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^here cluster (CI.) designates a group of isolates with a distinct genotype defined by 
IS6110 RFLP, spoligotype, and multi-locus VNTR analysis; n is the number of isolates 
within the cluster; the IS6110 RFLP is numerically designated (Dsg.) and the image is 
shown with high to low molecular weights oriented from left to right; spoligotype 
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nomenclature as defined by Dale, 2001 (6); and Mtb-vl through Mtb-v20 indicate the 
amplicon size (number of base pairs) observed for the corresponding VNTR amplicons. 

[0048] The system of analysis has been applied to the globally prevalent 

5 homogeneous W-Beijing family of strains. Two sets of clinical isolates of M 

tuberculosum: a set of 34 Beijing strains and another set of 57 isolates having 4 IS61 10 
RFLP bands. The details of this analysis are presented in the Experimental Section 
hereinbelow The results of these analyses illustrate that the characterization of molecular 
diversity with MLVA to the strain-typing of Mycobacterium tuberculosum method can be 
10 used for the rapid discrimination and identification of remaining major Mycobacterium 
tuberculosum species. They also illustrate that the sub-typing scheme useful for the 
epidemiological study of Mycobacterium tuberculosum may be applied to the local 
detection of the pathological causative agent of tuberculosis. 

[0049] Kits are herein provided for use with commercially available PCR 

1 5 instruments to detect and sub-type strains of Mycobacterium tuberculosum. The kits 
contain one or more primer pairs disclosed hereinabove having SEQ ID NOS: 1-1 8 for 
amplifying the VNTR in DNA isolated from a Mycobacterium tuberculosum sample. If 
the sample is to be multiplexed, the kits may contain* a suitable "cocktail" of primer pairs. 

[0050] The kits may also contain nucleic acids needed in the amplification 

20 process. The nucleic acids may be tagged by a suitable marker, a fluorescent probe or a 
radioactive molecule. Any tag for marking the nucleic acid after amplification and size 
separation as by electrophoresis or other separation means is suitable. In certain 
preferred embodiments of the invention, the primer pairs themselves comprise a suitable 
marker. 

25 [0051] The kits may also comprise enzymes, taq polymerase, for example and 

salts and buffers suitable for causing amplification of DNA by PCR. This kits may also 
comprise suitable containers and bottles for housing these reagents and or convenient use. 

[0052] Kits for sub-typing strains of Mycobacterium tuberculosum comprise, in 

addition, DNA isolated from known Mycobacterium tuberculosum strains. This isolated 
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DNA containing VNTR loci may be used as standards in the sub-typing of the species. 
In certain instances the kits may comprise in addition primers and standard references for 
MIRU analysis. 

EXPERIMENTAL SECTION 

5 

[0053] The following experiments illustrate the materials and methods used in 

genotyping M tuberculosum isolates by MVLA with the primers of the present invention. 

Source of Mycobacterium tuberculosum strains. 

10 

[0054] As part of the Centers for Disease Control and Prevention (CDC) National 

Tuberculosum Genotyping and Surveillance Network, the University of Texas Health 
Science Center at San Antonio (UTHSCSA) serves as a genotyping reference laboratory 
for M tuberculosum isolates collected by the Texas Department of Health (TDH). From 

15 1996 to present, isolates from 4,269 persons with TB were genotyped (4, 15). The 

Institutional Review Boards of the TDH and the UTHSCSA approved culture collection 
and analyses (TDH-CDC cooperative agreement, TDH protocol number 980025, and 
UTHSCSA protocol number 001-6000-105). Two sets of clinical isolates were analyzed. 
One set consisted of 57 isolates having four IS61 10 RFLP bands. The second set 

20 included 34 isolates with an identical Beijing spoligotype pattern containing spacer 
oligonucleotides 36-43 and 14 different IS61 10 RFLP patterns. 

DNA isolation 

[0055] M. tuberculosum isolates were grown on Lowenstein- Jensen media slants 

(Becton, Dickinson, and Co., Franklin Lakes, NJ) for four to six weeks at 37°C under 5% 

25 CO2. In a BioSafety Level 3 facility, the bacterial mass was removed using a sterile 

inoculating loop, placed in a microcentrifuge tube containing 1ml ddH20, and heat-killed 
at 85°C for 20min. The cells were pelleted by centrifugation and resuspended in a lOmM 
Tris-HCl, 1 mM EDTA buffer (pH 8.0). Cell walls were digested with lysozyme (10 
mg/ml), proteinase K (10 mg/ml), and 10% sodium dodecyl sulfate. DNA was extracted 

30 using 0.3 M cetyltrimethylammonium bromide and 5M NaCl, purified by chloroform- 
isoamyl alcohol separation, and precipitated using isopropanol (Sigma, St. Louis, MO). 
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Spoligotyping 

[0056] Spoligotype analyses (10) were done using locally synthesized and 

biotinylated PCR primers (UTHSCSA Advanced Nucleic Acids Technology Core 
Facility, San Antonio, TX) and commercially available spoligotyping membranes 
5 (Isogen, Maarssen, The Netherlands) as previously described (15). 

1S61 10RFLP analysis. 

[0057] IS61 10-based restriction fragment length polymorphism (RFLP) was 

performed by Southern blotting of PvuII-digested genomic DNA using a 523 bp (base 
10 pair) right-handed IS61 10 probe (23), the ECL™ detection system (Amersham, 

Piscataway, New Jersey), and Biolmage Whole Band Analyzer version 3.4.2 (Genomic 
Solutions, Ann Arbor, MI) as previously described (15). 

VNTR identification and primer design 

1 5 [0058] The complete genome sequence of the Mycobacterium tuber culosum 

H37Rv strain was downloaded from the Sanger Centre (WWW.sanger.ac.uk). Potentially 
polymorphic repetitive sequences were identified using the DNAstar software program 
Genequest (Lasergene, Inc Madison,- WI). Selection criteria of repetitive sequence were 
set for nucleotide repeat motifs of more than 8bp within 1 00 bp proximity of each other. 

20 Primers were designed around 84 repetitive sequences identified using the DNAstar 

software program Primer Select (Lasergene, Inc Madison, WA). Complementary primers 
were designed around interspersed repeats to minimize risk of mobile DNA targets. 
Fifteen of these repetitive sequences were found to be polymorphic, i.e., VNTR loci. 
Nine loci were chosen for use in these analyses (Table. 1) 

25 

PCR amplification of VNTR loci 

[0059] PCR reactions were performed using a total volume of 10 jaL. All PCR 

reagents used were obtained from InVitrogen (Madison, WI) unless otherwise indicated. 
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Each PCR reaction mix contained the following reagents and concentrations: lxPCR 
buffer (20 mM Tris-HCL [pH 8.4] 5 50 mM KCL); 2mM MgCl 2 > 200uM of each four 
deoxynucleoside triphosphates (dNTPs), 0.4 Units of Platinum Taq Polymerase (Gibco- 
Life Technologies) per uL, 0.2uM forward primer, 0.2uM reverse primer Rl 10, R6G, or 
5 Tamara phosphoramide fluorescent labeled oligonucleotides (Perkin Elmer Biosystems), 
1 uL DNA template, and ddH 2 0 bringing the total volume to 10 uL. Each lOul PCR mix 
was then denatured for 5 minutes at 95°C. Following the denaturing, the samples were 
cycled 35 times through the following program: 95°C for 20 seconds, 55°C for 25 
seconds, and 72°C for 20 seconds. These 35 cycles were followed by a final extension at 
10 72°C for 5 minutes and the samples were then stored at -20°C until genotyped. Reactions 
that failed under the above conditions were then repeated with 2ul of DNA in a lOul 
reaction; all other concentrations and conditions remaining the same. 



Genotyping 

1 5 [0060] Detailed automated genotyping methods have been described previously 

(Keim, P., L. B. Price, A. M. Klevytska, K. L. Smith, J. M. Schupp, R. Okinaka, P. J. 
Jackson, and M. E. Hugh-Jones. 2000. Multiple-locus variable-number tandem repeat 
analysis reveals genetic relationships within Bacillus anthracis. Journal of Bacteriology. 
182:2928-36)., herein incorporated in its entirety. Briefly, fluorescent labeled amplicons 

20 were sized and scored using the ABI software program Genotyper. 

Statistical Analysis 

[0061] Genetic distances were determined by calculating the percentage shared . 

alleles pair-wise among all isolates. The clustering method used was unweighted pair 
25 group method arithmetic average (UPGMA). UPGMA clustering analysis was performed 
using the software program NTSYS (16). There was no missing data across the nine 
VNTR loci and 91 isolates. The diversity index of each VNTR marker was calculated 
using [1-Z (allele frequencies) 2 ] (25). 
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EXAMPLE 1 

[0062] This example illustrates the amplicon produced during the amplification of 

VNTR locus Mtb-vl with primer pairs SEQ ID NO:' 1 and SEQ ID NO: 2. 

5' Beginning location of VNTR: 55,533. (All accession numbers are according to the 
5 H37Rv M. tuberculosum sequence done by the Sanger Institute. 
http://vyww.sanger.ac.uk/) 

GGCTCGGGCCTGCCGTCGGACATCTGGAAGGCAACCATGGACGGCGCCTT 

GAAGGGCACGTCGAACGAGACTTTCCCCAAACCGACCGAGGTCGGTGGTT 

ATGCCGGTGTGCCGCCGCCGCCGCCGCCGCCGGAGGTACCACCTTCGCAG 

1 0 ACCGTCATCCAGCCCACGGTCGAAATTGCGCCGGGGATTACCATCCCGATC 

GGTCCCCCGACCACCATTACCCTGGCGCCACCGCCCCCGGCCCCGCCCGCTG 

CGACTCCCACGCCGCCGCCGTGACCGGCGCGCTGTCCCAAAGCAGCAACATC 

TCGCCACTTCCTTTGGCCGCCGATCTGCGGAGCGCCGATAACCGCGATTGCCC 

CAGCCGCACCGACGTATTGGG 

15 FORWARD PRIMER 5' GTCGAACGAGACTTTCCCCAAACCGAC 3' 
REVERSE PRIMER 5* GACCGTGGGCTGGATGACGGTCTC 3* 

EXAMPLE 2 

[0063] This example illustrates the amplicon produced during the amplification of 

20 VNTR locus Mtb-v3 with primer pairs SEQ ID NO: 3 and SEQ ID NO: 4. 

5' Beginning location of VNTR 241,464. (All accession numbers are according to the 
H37Rv M. tuberculosum sequence done by the Sanger Institute. 
http://www.sanger.ac.uk/' ) 

5' GCGAGCCTACCGAAGATCGCGTGCATGCGTTCGGCGTGGACCGCACAGCA 
25 CCTGGAGTTGGCGGCGCCGAGGGCCGAGATGGCAGGATGACCGATCGTCG 
GGGGCGGGAACTCCCAGGCCGCCGGGCCGTCGCAAACCCGTCGCAAACC 
CGTCGCAAACCGTAAGGAGTCATCCATGAAGACAGGCACCGCGACGACGCG 
GCGCAGGCTGTTGGCAGTACTGATCGCCCTCGCGTTGCCGGGGGCCGCCG 
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TTGCGCTGCTGGCCGAACCATCAGCGACCGGCGCGTCGGACCCGTGCGCGGC 

C AGCG AAGTGGCGAGG ACGGTCGGTTCGGTCGCC AAGTCGATGGGCGACT 3 ' 

FORWARD PRIMER 5' GATGACGGATCGTCGGGGGCGGGAAC 3' 
REVERSE PRIMER 5* GCAACGCGAGGGCGATCAGTACTGCCAACA 3' 

5 

EXAMPLE 3 

[0064] This example illustrates the amplicon produced during the amplification of 

VNTR locus Mtb-v4 with primer pairs SEQ ID NO: 5 and SEQ ID NO: 6. 

5' Beginning location of VNTR -241,464. (All accession numbers are according to the 
10 H37Rv M. tuberculosum sequence done by the Sanger Institute. 
http://www.sanger.ac.uk/') 

ATTCAGGCCGCCGAATCCACTACCGATGATGACCACGCGATGGCGCCCGC 

CGACGGCCGAGGGTTCACCAGATGAGAGCGTCATGGTCCTCCTTCAGTCTGG 

TCGCTGTGGCGCAGCTACACAGTACGACTCCCGTCATGCCAACGGCGTAA 

1 5 CTTTTTGTGGGCCTTGTGGGCCTTGTGGGCCTTGTGGGCCTTTGTCGGGC 
CGCCTTCGGATCGGACGCTCGGGATGGCTGTTGGGCGCTGCGCAATCCCGC 
GCTTCGATCAGGCAGCGTCCGGCAGTGCCATCAATGGCGGCCAGGTACACCT 
CTCCGACGGCTCGACATCGCCGGCCCGGCAGTTACCTGCACCATGGCCGGGC 
GATGCGGGAGCGGCTGCCGAAGGTCGGGCAGGTGTTTGCTGCCGGGGAAATC 

20 GACTACCACATGTTTCAGACGTTGGTGTATCGCACCGATTTGATCACCGACCC 

GCAGGTGTTGGCGCGGGTGGATGCCGAGCTGGCGCTGCGGGTGCGGGGCT 

FORWARD PRIMER 5' GCTGTGGCGCAGCTACACAGTACGACTC 3' 
REVERSE PRIMER 5' GATTGCGCAGCGCCCAACAGC 3' 

25 

EXAMPLE 4 

[0065] This example illustrates the amplicon produced during the amplification of 

VNTR locus Mtb-v5 with primer pairs SEQ ID NO: 7 and SEQ ID NO: 8. 
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5' Beginning location of VNTR 241,464. (All accession numbers are according to the 
H37Rv M. tuber culosum sequence done by the Sanger Institute. 
http://www.sanger.ac.uk/) 

AGCCAGCAGAGCTTCGAGGAAGTGTCGGCGCGCGCCGAAGGCTACGTGG 

5 ACCAGGCGGTGGAGTTGACCCAGGAGGCGTTGGGTACGGTCGCATCGCAG 

ACCCGCGCGGTCGGTGAGCGTGCCGCCAAGCTGGTCGGCATCGAGCTGCCTA 

AGAAGGCTGCTCCGGCCAAGAAGGCCGCTCCGGCCAAGAAGGCCGCTCC 

GGCCAAGAAGGCGGCGGCCAAGAAGGCGCCCGCGAAGAAGGCGGCGGC 

CAAGAAGGTCACCCAGAAGTAGTCGGGCTCCGAATCACCATCGACTCCGA 

1 0 GTCGCCCACGGGGCGACTCGGAGTCGACGTGTTGGATGCAAACCGCATAGTC 

TGAATGCGTGAGCCACCTCGTGGGTACCGTCATGCTGGTATTGCTGGTCGCCG 

TCTTGGTGACAGCGGTGTACGCGTTTGTGCATGCTGCGTTGCAGCGGCCCGAT 

GCCTATACCGCCGCCGACAA 

FORWARD PRIMER 5* GGAGGCGTTGGGTACGGTCGCATC 3' 
15 REVERSE PRIMER 5' GATTCGGAGCCCGACTACTTCTGGGTGAC 3' 

EXAMPLE 5 

[0066] This example illustrates the amplicon produced during the amplification of 

VNTR locus Mtb-v6 with primer pairs SEQ ID NO: 9 and SEQ ID NO: 10. 

20 5' Beginning location of VNTR 1,112,923. (All accession numbers are according to the 
H37Rv M. tuberculosum sequence done by the Sanger Institute. 
http://www.sanger.ac.uk/') 

GCAGACCGCAGGTGCCGACGAGCCGGACTACTTAGACGTCGATGTGGTC 

GAAGAAGACTCGGAGGCGCTTCCGGTGGGGGCTGGCGCTGCGGTCGGCGAGT 

25 CCGCCGACGAGGCCGATGCCGAAGCTGCTGACGGAGTTGCGGGCCACGCC 

GACCCGGAGGCCGACCCGGTCGAATACGAATACGAATACGAATACGTCGA 

GGACACCTGCGGTTTGGAGCTCGAGGAGGACGACCAGGAAGCGCCACCGAC 

CGTCGCATCCGGCACGTCACGGCGGCGCCGATTCGACACCAAGACCGCCGCC 
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GCGGTCAGCGCCCGCAAGTACACCTTCCGCAAACGTGCGTTGATCGTGATGG 

CGGTGATCCTGGTTGGCTCTGCCGCCGCGGCCTTCGAGCTGACCCCGGTCG 

CGTGGTGGATCTGTGGTAGCGCCACCGGTGTGACGGTGCTCTACCTGGCATAT 

TTGCGTCGGCAAACCCGCATCGAGGAGAAGGTGCGTCGGCGGCGGATGCAG 

5 CGGATCGCGCGGGCGCGGCTCGGTGTAGAGAACACCCGTG 

FORWARD PRIMER 5' CGCCGACGAGGCCGATGCCGAAGC 3' 
REVERSE PRIMER 5' CCGCGGCGGCAGAGCCAACCAGGAT 3* 

EXAMPLE 6 

10 This example illustrates the amplicon produced during the amplification of VNTR 

locus Mtb-vlO 

Beginning location of VNTR 2,604,155. (All accession numbers are according to the 
H37Rv M. tuberculosum sequence done by the Sanger Institute. 
http://www.sanger.ac.uk/') 

1 5 ATCCTTGCAGGTGTTCGGTGGTGACGGTCGTGTTGTCGGCGGCTTCCAGG 
CGCAACCGGAGGCGCCCAGCCCCACAATCAGTGCCAACAGTGCCAACAG 
TGCCAACAGTGCCAGAATCGGGACGGCGCTACGCTGACGACGCACGTCACG 
AGCTTAGCGAAAACTGGGAATTTCCCCTACGTTTCATCAACGCCTCAGGTGT 
CGATCCTAAAGCGCGGGTGCCGCCGGTATTCTTGCCCCAAATCGGTCGGTT 

20 GACACCCGATGCGGTCGGCGAAGCCATCGGCATCGCGGCCGACGACATCCCG 

ATGGCGGCACGCTGGATCGGCA.GCCGACCATGCTCGCTCATCGGCCAGCC 

FORWARD PRIMER 5* GGAGGCGCCCAGCCCCACAA 3' 
REVERSE PRIMER 5' TCAGGTGTCGATCCTAAAGCGCGGGTG 3' 

25 EXAMPLE 7 

[0067] This example illustrates the amplicon produced during the amplification of 

VNTR locus Mtb-vl5 with primer pairs SEQ ID NO: 13 and SEQ ID NO: 14. 
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[0068] Beginning location of VNTR 3,238,462. (All accession numbers are 

according to the H37Rv M. tuberculosum sequence done by the Sanger Institute. 
http://www.sanger.ac.uk/ ) 

CCGCGCCGATCGGTTCCGTTGATCCGCGCTCCCCGCAGCCGCGCCGTCAGA 
5 TCCGCGGGCCCGGCATCGTGGCGGCGCACAGCGCGGGGGTGGTGCACCCGA 
ACCTCGGTGATGGTCCGGCCGGTCACGTGAGCCTGCAAGCCGCGCCGCACC 
ACCTCGACTTCGGGCAGCTCGGGCATCCAGTGATGATCGCAAGCGCGGCG 
AAGCCGGGCGCAGCGGGTCATCACCATCGAACCAGTGATGATCGCAAGC 
GCGGCGAAGCCGGGCGCAGCGGGTCATCACCATCGAACCAGTGATGAT 

1 0 CGC AAGCGCGGCGAAGCCGGGCGCAGTCCCCCGCAAGCGGG AGGTGCC 
CCCAGGTCATCACCATCGAACCAGTGATCATCGCAAGCGCGGCGAAGCC 
GGGCGCAGTCCCCCCCAAGCGGGAGGTGCCCCCAGGTCATCACCATCG 
AACCAGTGATGATCGCAAGCGCGGCGAACCCGGCCGCAGTCCCCCGCA 
AGCGCGGCAAAGCCGGCGCCCCCAGGTCATCACCATCAATCCAGTTAGGCG 

1 5 GAGGTTTTGCCCGGCATGGCGTTGTCGAGC ACTTCC AGGGCTTTCC AAGCG 

GCCGCCGCGGCTTTTTGCTCGGCTTCTTTTTTGGACCGGCCCACTCCTGAAC 

FORWARD PRIMER 5' GCGCCGCACCACCTCGACTT 3* 
REVERSE PRIMER 5' GTTAGGCGGAGGTTTTGCCCGG 3' 

EXAMPLE 8 

20 [0069] This example illustrates the amplicon produced during the amplification of 

VNTR locus Mtb-vl8 with primer pairs SEQ ID NO 15 and SEQ ID NO 16. 

5' Beginning location of VNTR 4,227,024. (All accession numbers are according to the 
H37Rv M. tuberculosum sequence done by the Sanger Institute. 
http://www.sanger.ac.uk/) 

25 CGGGCTATCACCGGTTCATCTTCTGGGCCTATGGCCGCAACGGGGAGAGCG 

ATCCTCGGGTACTCGAGGCCATCCAAGTCCTCCGTATCCGCTATATCCTGACC 
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AGCACTCCGACGGTGCGGGGGTTTGCCGTGCCGGACGGACTAGTGTCGTTAG 

AGACATCGAGGTCGTGGGCGAAGATCTACGACAACGGCGAGGCCCGAATC 

TACGAATGGCGCGGCACTGCCGCAGCAACACACTCCTAGAAGGTGCGTAAG 

AGGATGGTGATTGGATTGAGTACCGGCAGCGACGACGACGACGTCGAGGTCA 

5 TCGGCGGCGTCGACCCGCGGCTGATAGCGGTGCAGGAGAACGACTCCGACG 

AGTCGTCGCTGACCGACCTGGTCGAGCAGCCCGCCAAGGTGATGCGCATCGG 

CACCATGATCAAGCAACTGCTCGAGGAGGTTCGCGCCGCCCCACTCGACGAA 

GCCAGCCGCAATCGGCTACGCGATATCCACGCCACCAGCATCCGCGAACTC 

GAAGATGGTCTGGCCCCGGAACTGCGCGAGGAGCTCGACCGGCTTACCCTGC 

1 0 CGTTCA ACGAGGACGCCGTGCCCTCGGACGCCGAGTTGCGCATTGCCC AGGC 

ACAGCTGGTCGGCTGGCTGGAAGGGCTGTTCCACGGCATCCAAA 

FORWARD PRIMER 5' ACAACGGCGAGGCCCGAATCTACGAA 3' 
REVERSE PRIMER 5* ATAGGTGCGGTGGTCGTAGGCGC 3' 

15 EXAMPLE 9 

[0070] This example illustrates the amplicon produced during the amplification 

of VNTR locus Mtb-v20 with primer pairs SEQ ID NO 17 and SEQ ID NO 18. 
Beginning location of VNTR 23,693. (All accession numbers are according to the 
H37Rv M. tuberculosum sequence done by the Sanger Institute. 
20 http://www.sanger.ac.uk/) 

AGCGTATCCCGGGACCCGCGGGGATTCGGTGTAGCGGGTGTAGTCAGCGCC 

GCCGCCATAGTCTTGCCGGCCGTATGTCGTGGCGCCTTGCTGGTAGCCCTGGT 
CGTAACCGCCTTGGTCGGGGTAAGCCGGTCGTTGCTCGGGCGGGCCCGGAG 
GGCCAGAGGGCACATAGCTGCCCTCCTCGTGGCGAGCCGGGCCACGCCCGT 
25 ACTCCCCGTACCCGCCGTAGCCGGGCTGGCCGCCACCGGGTGAAGGGCCGTA 
GCCGCCGCTTTGGCGATAGCCCTGGTCGTAGCCGGGAGCGCCGTAGCCGGCA 
GCCGGGCCGGGAGAAACAGGAGGGCGTTGCTCGTAGGGCGGCGGATAGCCC 
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CCCTGCCCTTGGTCGGGGTAGCCTCGACCCTGGTCCTGGTACCCGCGCTGGTC 

GGGGTAGCCGCGTTGCTCGGGGTAACCGCGTTGCTCGGGGTAACCGCCC 

TGGTCGGGGTACCCGATTTGCTCGGGGTAGTCGCCCTGGTCCGGGTGGCGCG 

GGCGTGGGTAGCCCGGCTGGGGCGGGTAGCGGCCCGTCTCGGGTGGATACCC 

5 CCCGCGGGGGTCAGATCCGCCTTGCGGATCCGGGCCACCACGCGGATCCTCT 

TGCGGACGCGCATAGCGGTCGTCGTAATACTCGTCGGGACGCCCCTGCCCCT 

GACCGCCACGGTAGCTCGAATTGTCACTCATTGGTGCTACTCCTGGTTCTG 

CGCCAAACGCGTGGTTTGATTGTGGCCGGGCGCAATCGATGACCGGCGG 

FORWARD PRIMER 5' CCCGGAGGGCCAGAGGGCACATAGC 3' 
10 REVERSE PRIMER 5' GTCACTCATTGGTGCTACTCCTGGTTCTGCGCCA 
3' 

[0071] While certain of the preferred embodiments of the present invention have 

been described and specifically exemplified above, it is not intended that the invention be 
1 5 limited to such embodiments. Various modifications may be made thereto without 

departing from the scope and spirit of the present invention, as set forth in the following 
claims. 



-26- 



Anonymous. 2002. Global Tuberculosis Control Report. World Health 
Organization, pg.6 

Bifani PJ, M. B., Kurepina NE, Kreiswirth BN. 2002. Global dissemination of 
the Mycobacterium tuberculosis W family strains. Trends in Microbiology 10:45- 
52. 

Braden, C. R., J. T. Crawford, and B. A. Schable. Assessment of 

Mycobacterium tuberculosis Genotyping in a Large Laboratory Network. 

Emerging Infectious Diseases. 8(1 1): 12 10-5, 2002 November. 

Castro, K. G., and H. W. Jaffe. 2002. Rationale and methods for the national 

tuberculosis genotyping and surveillance network. Emerging Infectious Diseases. 

8:1188-91. 

Cowan, L. S., L. Mosher, L. Diem, J. P. Massey, and J. T. Crawford. 2002. 
Variable-Number Tandem Repeat Typing of Mycobacterium tuberculosis Isolates 
with Low Copy Numbers of IS6110 by Using Mycobacterial Interspersed 
Repetitive Units. Journal of Clinical Microbiology 40:1592-602. 
Dale, J. W., D. Brittain, A.A. Cataldi, D. Cousins. J.T. Crawford, J. Driscoll, 
H. Heersma, T. Lillebaek, T.N. Quitugua, N. Rastogi, D. van Soolingen, V. 
Wright. 2001. Spacer oligonucleotide typing of Mycobacterium tuberculosis: 
recommendations for standardized nomenclature. Int. J. Tuberc. Lung Dis. 5:216- 
219. 

Fang, Z., D. T. Kenna, C. Doig, D. N. Smittipat, P. Palittapongarnpim, B. 
Watt, and K. J. Forbes. 2001. Molecular evidence for independent occurrence of 
IS6J10 insertions at the same sites of the genome of Mycobacterium tuberculosis 
in different clinical isolates. Journal of Bacteriology. 183:5279-52847. 
Farlow, J., K. L. Smith, J. Wong, M. Abrams, M. Lytle, and P. Keim. 2001. 
Francisella tularensis strain typing using multiple-locus, variable-number tandem 
repeat analysis. Journal of Clinical Microbiology. 39:3186-92. 
Frothingham, R., and W. A. Meeker-CVConnell. 1998. Genetic diversity in the 
Mycobacterium tuberculosis complex based on variable numbers of tandem DNA 
repeats. Microbiology 144: 1 1 89-96. 

Kamerbeek J., L. S., A. Kolk, M. van Agterveld, D. van Soolingen, S. 
Kuijper, A. Bunschoten, H. Molhuizen, R. Shaw, M. Goyal, J. D.A van 
Embden. 1997. Simultaneous detection and strain differentiation of 
Mycobacterium tuberculosis for diagnosis and epidemiology. J. Clin. Microbiol. 
35:907-914. 

Keim, P., L. B. Price, A. M. Klevytska, K. L. Smith, J. M. Schupp, R. 
Okinaka, P. J. Jackson, and M. E. Hugh-Jones. 2000. Multiple-locus variable- 
number tandem repeat analysis reveals genetic relationships within Bacillus 
anthracis. Journal of Bacteriology. 182:2928-36. 

Kremer, K., D. van Soolingen, R. Frothingham, W. H. Haas, P. W. Hermans, 
C. Martin, P. Palittapongarnpim, B. B. Plikaytis, L. W. Riley, M. A. Yakrus, 
J. M. Musser, and J. D. van Embden. 1999. Comparison of methods based on 



-27- 



